Multilingual Knowledge-Based Concept Recognition in Textual Data

نویسندگان

  • Martin Schierle
  • Daniel Trabold
چکیده

With respect to the increasing volume of textual data which is available through digital resources today, the identification of the main concepts in those texts becomes increasingly important and can be seen as a vital step in the analysis of unstructured information. Research in this area has focused on the detection of named entities like person names or organization names, which only cover a very small part of concepts in texts. Especially the unique mapping between concepts in different languages requires parallel corpora, which are rarely available in industrial settings. We therefore propose a powerful new knowledge based model to recognize various kinds of concepts even in very short and specialized texts using linguistic information for synonym handling and word sense disambiguation. We evaluate the proposed model on texts from the automotive domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A multilingual text mining approach to web cross-lingual text retrieval

To enable concept-based cross-lingual text retrieval (CLTR) using multilingual text mining, our approach will first discover the multilingual concept–term relationships from linguistically diverse textual data relevant to a domain. Second, the multilingual concept–term relationships, in turn, are used to discover the conceptual content of the multilingual text, which is either a document contai...

متن کامل

Managing Multimodal and Multilingual Semantic Content

With the advent and increasing popularity of Semantic Wikis and the Linked Data the management of semantically represented knowledge became mainstream. However, certain categories of semantically enriched content, such as multimodal documents as well as multilingual textual resources are still difficult to handle. In this paper, we present a comprehensive strategy for managing the life-cycle of...

متن کامل

Exploiting Knowledge Bases for Multilingual and Cross-lingual Semantic Annotation and Search

The amount of entities in large knowledge bases (KBs) has been increasing rapidly, making it possible to propose new ways of intelligent information access. In addition, there is an impending need for systems that can enable multilingual and cross-lingual information access. In this work, we firstly demonstrate X-LiSA, an infrastructure for multilingual and cross-lingual semantic annotation, wh...

متن کامل

Conceptual Modeling with Formal Concept Analysis on Natural Language Texts

The paper presents conceptual modelling technique on natural language texts. This technique combines the usage of two conceptual modeling paradigms: conceptual graphs and Formal Concept Analysis. Conceptual graphs serve as semantic models of text sentences and the data source for concept lattice – the basic conceptual model in Formal Concept Analysis. With the use of conceptual graphs the Text ...

متن کامل

L2 Learners' Acquisition of English Nominal Clauses: Effects of Textual Enhancement, Metalinguistic Explanation, and Self-Regulation

This study aimed to investigate the impact of textual enhancement and metalinguistic explanation as focus-on-form tasks tending to encourage the acquisition of nominal clauses (NCs) in English. It explored (a) whether textual enhancement and metalinguistic explanation would promote and enhance the knowledge of NCs, (b) whether these two tasks would differ in terms of enhancing learners' knowled...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008